Text Categorization from Category Name via Lexical Reference
نویسندگان
چکیده
Requiring only category names as user input is a highly attractive, yet hardly explored, setting for text categorization. Earlier bootstrapping results relied on similarity in LSA space, which captures rather coarse contextual similarity. We suggest improving this scheme by identifying concrete references to the category name’s meaning, obtaining a special variant of lexical expansion.
منابع مشابه
Dynamic Categorization of Semantics of Fashion Language: A Memetic Approach
Categories are not invariant. This paper attempts to explore the dynamic nature of semantic category, in particular, that of fashion language, based on the cognitive theory of Dawkins’ memetics, a new theory of cultural evolution. Semantic attributes of linguistic memes decrease or proliferate in replication and spreading, which involves a dynamic development of semantic category. More specific...
متن کاملFeature-Semantic Gradients in Lexical Categorization Revealed by Graded Manual Responses
Participants performed a categorization task in which basiclevel animal names (e.g., cat) were assigned to their superordinate categories (e.g., mammal). Manual motor output was measured by sampling computer-mouse movement while participants clicked on the correct superordinate category label, and not on a simultaneously presented incorrect category. Animal names were selected from the concept-...
متن کاملCategorizing Local Contexts as a Step in Grammatical Category Induction
Building on the use of local contexts, or frames, for human category acquisition, we explore the treatment of contexts as categories. This allows us to examine and evaluate the categorical properties that local unsupervised methods can distinguish and their relationship to corpus POS tags. From there, we use lexical information to combine contexts in a way which preserves the intended category,...
متن کاملCategorical Information in Pharmaceutical Terminologies
Drug information sources use category labels to assist in navigating and organizing information. Some category labels describe drugs from multiple perspectives (e.g., both structure and function). The National Drug File - Reference Terminology (NDF RT) is a drug information source that augments a "legacy" categorization system with a formal reference model specifying Chemical Structure, Cellula...
متن کاملL2 Learners’ Lexical Inferencing: Perceptual Learning Style Preferences, Strategy Use, Density of Text, and Parts of Speech as Possible Predictors
This study was intended first to categorize the L2 learners in terms of their learning style preferences and second to investigate if their learning preferences are related to lexical inferencing. Moreover, strategies used for lexical inferencing and text related issues of text density and parts of speech were studied to determine their moderating effects and the best predictors of lexical infe...
متن کامل